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1. BACKGROUND 


2. THE GROBNER INITIALIZATION 


At optical wavelengths and from the vantage point of space, the multiple scattering 
cloud medium obscures one’s view and prevents one from easily determining what 
flashes strike the ground. However, recent investigations have made some progress 
examining the (easier, but still difficult) problem of estimating the ground flash 
fraction in a set of N flashes observed from space [Koshak (2010), Koshak and 
Solakiewicz (2011), and Koshak (2011)]. 

In the study by Koshak (2011), a Bayesian inversion method was introduced for 
retrieving the fraction of ground flashes in a set of flashes observed from a (low earth 
orbiting or geostationary) satellite lightning imager. The method employed a 
constrained mixed exponential distribution model to describe the lightning optical 
measurements. To obtain the optimum model parameters, a scalar function of three 
variables (one of which is the ground flash fraction) was minimized by a numerical 
method. This method has formed the basis of a Ground Flash Fraction Retrieval 
Algorithm (GoFFRA) that is being tested as part of GOES-R GLM risk reduction. 

Figure 1 summarizes the basic functionality of the GoFFRA, and Figure 2 highlights 
the mathematical attributes of the Bayesian retrieval process. 





INPUT 

N observed 
flashes 

GoFFRA 

OUTPUT 

Fraction, a, 
of the N flashes that 
strike the ground 

Therefore: 

# ground flashes = aN 
# cloud flashes = (1- cc)N 





Figure 1. The basic function of the Ground Flash Fraction Retrieval Algorithm (GoFFRA). 
The satellite lightning imager (e.g., OTD, LIS, or GLM) observes N flashes during time period 
At; the algorithm outputs the fraction of those N flashes that are ground flashes. The sample 
size A must be sufficiently large (typically 2000 or larger) to keep retrieval errors small. 


Mixed Exponential Distribution Model 

Basic Definitions: 

x = Maximum Group Area (MGA) in a flash 
y = x- 6Abn 1 (Shifted MGA) 

N g = # Ground flashes in lat/lon bin of interest 
N - # Flashes in lat/lon bin of interest 
a. - N. j N (Ground flash fraction) 

p ? - Population mean y for ground flashes (in lat/lon bin of interest) 
ji c = Population mean y for cloud flashes (in lat/lon bin of interest) 


Distribution of MGA modeled as a Mixed Exponential Distribution: 

p(v) = ap , ( v) + (1 - CL)p c (v) = — e~ y/Mg -+ 

Population Means of y: 
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Require that: 
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Bayesian Inversion 


Bayes’ Law: 


P{a,p s ,p c y) = 


P(y\a,p s ,p c )P(a,p g ,p c ) 

P(y) 


Find parameters v- (a,p g ,p c ) that maximize the probability on LHS. 


This means one maximizes the following : 

m 

S(v) = In [/>(y | v)/>(v)] = In n piy, | v) + In P(v) =£ln 

Formally : 
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v = "Maximum A Posteriori (MAP) Solution" 


Practically : 

Use Broyden-Fletcher-Goldfarb -Shannon variant of Davidon-Fletcher-Powell numerical method to 
minimize - S(v). Also, P(v) is simplified by assuming model parameter independence, with P(a) 
uniform, and P(fi„) &, P(fi ) both normal distributions. 


Estimative Initialization Scheme 
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Figure 2. The upper left slide defines basic variables, and the upper right slide summarizes the Bayesian 
retrieval approach. To obtain the optimum parameters, the minimization begins by initializing the parameters as 
shown in the bottom slide [see Koshak (2011) for additional details on all of these slides]. 


Note that the estimative initialization scheme provided in Figure 2 involves only two 
moments (the mean and the standard deviation); it also just initializes the ground 
flash fraction (alpha) to it’s centerline value 0.5. By including the third moment, 
(skewness, yf of the lightning optical characteristic, we arrive at a set of 3 
polynomials as shown below: 

Set of 3 Polynomial Equations 
in 3 Unknowns (a,n s ,n c ) : 

a/i g +(\-a)/t c = /t 

a/il + 6 “«)A =f(A+o- 2 ) 

a/i\ +{\-a)/f=\{p + 3 /mt 2 + ) 


Solving this system without any guess-work can be accomplished using Grobner 
bases. [For perspective, if the equations were linear the Grobner bases method would 
reduce to Gaussian Elimination common in linear algebra.] We employ Mathematica 
to find the Grobner bases of this system; the Mathematica utility is called 
GroebnerBasis which uses an efficient version of the Buchberger algorithm to 
compute the polynomial bases. We obtain a total of 1 1 polynomials that define the 
Grobner bases. Of these, we pick the three easiest to solve, which are: 


3 (of 11) Grobner Bases: 

ccM g + (1 - a)M c = M 

(M 2 - q)M 2 +(r- Mf)M c + (q 2 -t*r) = 0 

where : 

\ , 2 2 N 

q = \(M +<r ) 

r = j(M 3 + 3/*<f ) 

(p,<y\y x ) = (mean, variance, skewness) 


This set of 3 equations has a solution (which is also a solution to the original set of 
polynomials) given by: 


Solution (= Grobner Initialization): 


B g = 
M c = 
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where : 

A = // — g, B = r — fiq , C = q— jur 


This represents an analytic solution. It will be used to replace the estimative 
initialization scheme discussed in Figure 2. We expect it to improve the Bayesian 
retrieval results. 
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